3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA
Size:
800 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Harnessing the linguistic signal to predict scalar inferences
-
Paper track:Long/Discourse and Pragmatics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sebastian Schuster | Scalar implicature prediction evaluation corpus | /N |
Documentation:
None
Modality Independent
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
CC0-1.0
Size:
1.01 GByte Production Status:
Newly created-finished
Use:
Knowledge Discovery/Representation
-
Paper title:MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs
-
Paper track:Short/NLP Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jifan Yu | MOOCCube | /N |
Documentation:
Yes, the doc have Chinese and English version, and is now publicly available.
Written
Augmentation dataset,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
MIT
Size:
944 KByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Syntactic Data Augmentation Increases Robustness to Inference Heuristics
-
Paper track:Short/Semantics: Textual Inference and Other Areas of S
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Junghyun Min | Syntactic Augmentaion NLI | /N |
Documentation:
https://github.com/Aatlantise/syntactic-augmentation-nli/blob/master/README.md
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Released under OANC's license
Size:
227 MByte Production Status:
Existing-updated
Use:
Machine Learning
-
Paper title:Syntactic Data Augmentation Increases Robustness to Inference Heuristics
-
Paper track:Short/Semantics: Textual Inference and Other Areas of S
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Junghyun Min | Multi-Genre NLI Corpus | /N |
Documentation:
https://www.nyu.edu/projects/bowman/multinli/
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
MIT
Size:
3205 KByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Syntactic Data Augmentation Increases Robustness to Inference Heuristics
-
Paper track:Short/Semantics: Textual Inference and Other Areas of S
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Junghyun Min | Heuristic Analysis for NLi Systems | /N |
Documentation:
https://github.com/tommccoy1/hans/blob/master/README.md
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Not specified
Size:
17260 sentences Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope
-
Paper track:Long/Interpretability and Analysis of Models for NLP
-
Paper status:Accept - Shepherd
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yiyun Zhao | SFU Review corpus | /N |
Documentation:
http://www.lrec-conf.org/proceedings/lrec2012/summaries/533.html
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Open Source
Size:
18.8 MByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope
-
Paper track:Long/Interpretability and Analysis of Models for NLP
-
Paper status:Accept - Shepherd
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yiyun Zhao | ConanDoyle-neg corpus | /N |
Documentation:
http://www.lrec-conf.org/proceedings/lrec2012/summaries/221.html
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
6000000 entries Production Status:
Existing-used
Use:
Explanations
-
Paper title:Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations
-
Paper track:Short/Interpretability and Analysis of Models for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Oana-Maria Camburu | e-SNLI | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike (CC BY-SA)
Size:
40000 sentences Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Analyzing analytical methods: The case of phonology in neural models of spoken language
-
Paper track:Long/Interpretability and Analysis of Models for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Grzegorz Chrupała | MIT Flickr Audio Caption Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-NC-SA 3.0
Size:
2869657 pairs OtherProduction Status:
Existing-used
Use:
Natural Language Generation
-
Paper title:Paraphrase Generation by Learning How to Edit from Samples
-
Paper track:Long/NLP Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Amirhossein Kazemnejad | Twitter URL paraphrasing corpus | /N |
Documentation:
None




